Dissimilarity Plots:

نویسندگان

  • Michael Hahsler
  • Kurt Hornik
چکیده

For hierarchical clustering, dendrograms provide convenient and powerful visualization. Although many visualization methods have been suggested for partitional clustering, their usefulness deteriorates quickly with increasing dimensionality of the data and/or they fail to represent structure between and within clusters simultaneously. In this paper we extend (dissimilarity) matrix shading with several reordering steps based on seriation. Both methods, matrix shading and seriation, have been well-known for a long time. However, only recent algorithmic improvements allow to use seriation for larger problems. Furthermore, seriation is used in a novel stepwise process (within each cluster and between clusters) which leads to a visualization technique that is independent of the dimensionality of the data. A big advantage is that it presents the structure between clusters and the micro-structure within clusters in one concise plot. This not only allows for judging cluster quality but also makes mis-specification of the number of clusters apparent. We give a detailed discussion of the construction of dissimilarity plots and demonstrate their usefulness with several examples.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

of “ H - plots for displaying non - metric dissimilarity matrices . ”

In this file, I show more results and examples from the paper “H-plots for displaying non-metric dissimilarity matrices”, together with more illustrative results on other databases: an image database of handwritten “3”s, the classical Iris data set, and the Swiss roll dataset, considered in Tenenbaum et al. [16] and Roweis and Saul [12]. Each database corresponds to a different section. This wo...

متن کامل

Dissimilarity Plots: A Visual Exploration Tool for Partitional Clustering

For hierarchical clustering, dendrograms provide convenient and powerful visualization. Although many visualization methods have been suggested for partitional clustering, their usefulness deteriorates quickly with increasing dimensionality of the data and/or they fail to represent structure between and within clusters simultaneously. In this paper we extend (dissimilarity) matrix shading with ...

متن کامل

A family of functional dissimilarity measures for presence and absence data

Plot-to-plot dissimilarity measures are considered a valuable tool for understanding the complex ecological mechanisms that drive community composition. Traditional presence/absence coefficients are usually based on different combinations of the matching/mismatching components of the 2 × 2 contingency table. However, more recently, dissimilarity measures that incorporate information about the d...

متن کامل

The Typhoon Tracks Analysis using Tri-plots and Markov chain

Based on the fractal dimension, the tri-plots can classify two large and not equal sizes of the time series datasets. The tri-plots measure three function values which include two self-plots and one cross-plot. The self-plot affords the character of one individual dataset. The cross-plot describes the relation between two datasets. Originally, the tri-plots just can get the relation in two data...

متن کامل

A statistical method for quantifying songbird phonology and syntax.

Songbirds are the preeminent animal model for understanding how the brain encodes and produces learned vocalizations. Here, we report a new statistical method, the Kullback-Leibler (K-L) distance, for analyzing vocal change over time. First, we use a computerized recording system to capture all song syllables produced by birds each day. Sound Analysis Pro software [Tchernichovski O, Nottebohm F...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009